Managing Uncertain Mediated Schema and Semantic Mappings Automatically in Dataspace Support Platforms

نویسندگان

  • Nathalie Cindy Kuicheu
  • Ning Wang
  • Gile Narcisse Fanzou Tchuissang
  • De Xu
  • Guojun Dai
  • François Siewe
چکیده

Contrary to existing heterogeneous data integration systems which need to be fully integrated before using, a Dataspace Support Platform is a self-sustained system which automatically provides for the user its best endeavor results regardless of how integrated its sources are. Therefore, a Dataspace Support Platform needs to support uncertainty in mediated schema and in schema mappings. This paper proposes a novel approach to automatically providing reliable mediated schemas and reliable semantic mappings in Dataspace Support Platforms. Our aim is to increase the system’s endeavor results by leading it to considering as much as pos∗ correspondence author 176 N. C. Kuicheu, N. Wang, G. N. Fanzou Tchuissang, D. Xu, G. Dai, F. Siewe sible information available in any source connected. In fact, we first extract from the source schemas, their corresponding graph representations. Then, we introduce algorithms which automatically extract a set of mediated schemas from the graph representations and a set of semantic mappings between a source and a target mediated schema. Finally, we assign reliability degrees to the mediated schema generated and to the semantic mappings. Indeed, the higher the reliability degree of a given mediated schema or semantic mapping, the more consistent with the source it is. Compared with existing systems, experimental results show that our system is faster and, although completely automatic, it produces reliable mediated schemas and reliable semantic mappings which are as accurate as those produced by semi-automatic systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Iterative Approach to Managing Uncertain Mappings in Dataspace Support Platforms

A DataSpace Support Platform (DSSP) is a self-sustained and self-managed system which needs to support uncertainty among its mediated schemas and its schema mappings. Some approaches for managing such uncertainty by assigning probabilities and reliability degrees to schema mappings have been proposed. Unfortunately, the number of mappings self-generated by a DSSP is usually too large and among ...

متن کامل

Uncertainty in Data Integration and Dataspace Support Platforms

Data integration has been an important area of research for several years. However, such systems suffer from one of the main drawbacks of database systems: the need to invest significant modeling effort upfront. Dataspace Support Platforms (DSSP) envision a system that offers useful services on its data without any setup effort, and improve with time in a pay-as-you-go fashion. We argue that in...

متن کامل

An Improved Semantic Schema Matching Approach

Schema matching is a critical step in many applications, such as data warehouse loading, Online Analytical Process (OLAP), Data mining, semantic web [2] and schema integration. This task is defined for finding the semantic correspondences between elements of two schemas. Recently, schema matching has found considerable interest in both research and practice. In this paper, we present a new impr...

متن کامل

Data Modeling in Dataspace Support Platforms

Data integration has been an important area of research for several years. However, such systems suffer from one of the main drawbacks of database systems: the need to invest significant modeling effort upfront. Dataspace Support Platforms (DSSP) envision a system that offers useful services on its data without any setup effort, and improve with time in a pay-as-you-go fashion. We argue that in...

متن کامل

Schema Mediation and Query Processing in Peer

P2P Data Management Systems (PDMSs) allow the efficient sharing of data between peers with overlapping sources of information. These sources share data through semantic mappings between peers. In current systems, queries are asked over each peer’s local schema and then translated using the semantic mappings between peers. In this thesis we propose that a mediated schema can benefit PDMSs by all...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Computing and Informatics

دوره 32  شماره 

صفحات  -

تاریخ انتشار 2013